Guiding exploration by pre-existing knowledge without modifying reward

نویسنده

  • Kary Främling
چکیده

Reinforcement learning is based on exploration of the environment and receiving reward that indicates which actions taken by the agent are good and which ones are bad. In many applications receiving even the first reward may require long exploration, during which the agent has no information about its progress. This paper presents an approach that makes it possible to use pre-existing knowledge about the task for guiding exploration through the state space. Concepts of short- and long-term memory combine guidance by pre-existing knowledge with reinforcement learning methods for value function estimation in order to make learning faster while allowing the agent to converge towards a good policy.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bi-Memory Model for Guiding Exploration by Pre-existing Knowledge

Reinforcement learning agents explore their environment in order to collect reward that allows them to learn what actions are good or bad in what situations. The exploration is performed using a policy that has to keep a balance between getting more information about the environment and exploiting what is already known about it. This paper presents a method for guiding exploration by pre-existi...

متن کامل

Dual Memory Model for Using Pre-existing Knowledge in Reinforcement Learning Tasks

Reinforcement learning agents explore their environment in order to collect reward that allows them to learn what actions are good or bad in what situations. The exploration is performed using a policy that has to keep a balance between getting more information about the environment and exploiting what is already known about it. This paper presents a method for guiding exploration by pre-existi...

متن کامل

LITHOTHEQUE knowledge system about world-s mineral deposits supported by miniaturized sample sets: call for international adoption, networking and exchange

Our civilization is based on metals, among other life supports. The existing ore deposits are becoming rapidly depleted by almost exponentially increasing demand and production and major new ore discoveries are needed. Mineral exploration is supported by modern tools and scientific ideas, but geological characteristics of orebodies and their rock associations have still to be visualized.The tim...

متن کامل

The exploration of challenges in clinical knowledge management in nurses: a qualitative study

Background and Purpose: Clinical knowledge management (CKM) is considered as a dominant approach for information management and expansion of knowledge in clinical settings. Health care executives have recently begun to focus on CKM. Therefore, identification of challenges against proper CKM planning is of paramount importance. The aim of this study was to explore challenges in clinical knowl...

متن کامل

Chasing a Moving Target: Exploitation and Exploration in Dynamic Environments

A common justification for organizational change is that the circumstances in which the organization finds itself have changed, thereby eroding the value of utilizing existing knowledge. On the surface, the claim that organizations should adapt by generating new knowledge seems obvious and compelling. However, this standard wisdom overlooks the possibility that the reward to generating new know...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Neural networks : the official journal of the International Neural Network Society

دوره 20 6  شماره 

صفحات  -

تاریخ انتشار 2007